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When the controlled system is subject to parameter variations and external 
disturbances, a fixed-parameter proportional integral derivative (PID) 
controller cannot ensure its stabilization. In this case, its control requires 
online parameter adjustment. Specifically, as the quadrotor is a multi-input 
multi-output, nonlinear, and underactuated system, robust control is 
necessary to ensure efficient trajectory tracking flights. In this paper, an 
adaptive proportional integral derivative (APID) controller is proposed to 
control the quadrotor systems. This APID-based control strategy uses a two 
hidden layer deep feedforward network (DFN), where the one-step secant 
algorithm is chosen for initializing the DFN parameters. All the design steps 
of the proposed adaptive controller are described. The multidimensional 
particle swarm optimization (PSO) algorithm is used for tuning the DFN 
parameters. Then, using two simulation efficiency tests, a comparison 
between the proposed PSO-based APID-DFN, the (non-optimized) 
APID-DEN, the feedforward network APID, and the fixed-parameter PID 
controllers proves much efficiency of the proposed adaptive controller. The 
results illustrate that the proposed PSO-based APID-DFN controller can 
ensure good quadrotor system stabilization and achieve minimum overshoot 
and faster convergence speed for all quadrotor motions. Thus, the proposed 
control strategy could be considered an additional intelligent method-based 
adaptive control for unmanned aerial vehicles. 


This is an open access article under the CC BY-SA license. 


Corresponding Author: 
El Ayachi Chater 


LASTIMI Lab., Higher School of Technology, Mohammed V University 


Rabat, Morocco 


Email: elayachi.chater @est.um5.ac.ma 


1. INTRODUCTION 


Over the last years, the research control community has shown an increasing interest in flying 
vehicles without onboard human pilots known as unmanned aerial vehicles (UAVs). UAVs’ civilian 
applications have increased in diverse fields due to their low cost. Their application ranges from homeland 
security, disaster relief, and weather forecasting to power line inspection and precision agriculture [1]—[3]. 
Besides, as in the outdoors environment, the aerial vehicles are exposed to adverse atmospheric conditions, 
reliable and robust control strategies are necessary. 

Many papers deal with the control problem of the multi-rotor UAV systems. Some of these works 
propose the development of linear controllers like the proportional integral derivative (PID) and 
linear-quadratic regulators (LQR) [4], [5]. Besides, other works suggest the development of nonlinear 
methods to ensure UAV system stability. Among the latter, we can list the backstepping control approach [6], 
[7] and the sliding mode approaches [8], [9]. In addition, intelligent control strategies have been vastly used 
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to stabilize the quadrotor system, such as fuzzy logic control [10]-[12], artificial neural network (ANN) [13], 
and adaptive fuzzy inference system [14], [15]. Specifically, due to their simplicity and high reliability, the 
proportional integral derivative (PID) controllers have been widely used in industrial systems. However, the 
conventional PID controllers are not always satisfying in nonlinear systems such as the highly coupled 
quadrotor system. 

Several research works are still proposed for stabilizing and controlling the quadrotor system flight. 
For instance, a real-time experimental test is implemented to evaluate the PID controller for the quadrotor 
system. First, the PID controller parameters were selected to have the desired energy consumption [16]. 
Then, a PID flight controller for a small aerial vehicle was presented in [17], where the proposed algorithm 
was deployed using Bluetooth® low energy connection via a personal area network (PAN). Besides, a design 
and implementation solution that utilizes low-cost components with a PID control for a quadrotor system was 
proposed with acceptable experimental test results [4]. 

Indeed, the most crucial step for the PID controller design is finding appropriate gains that permit 
the stabilization of the nonlinear system despite system parameter variation and external disturbances. Thus, 
several methods are utilized for tuning the PID control gains. For instance, the PID parameters could be 
determined using the classical Zeigler-Nichols (ZN) method with tests for attitude stabilization of a quadrotor 
system. However, despite the easy implementation of the ZN method, it could generally generate an 
unacceptable overshoot. Therefore, optimization techniques are usually used to (offline) tune the PID 
controller gains according to a fitness function. Among these techniques, particle swarm optimization (PSO) 
[18], genetic algorithm (GA) [19], and ant colony optimization (ACO) are usually utilized [20]. These 
techniques are shown to be better than the ZN method. However, they only provide constant control 
parameters, which could lead to small control quality in the presence of external disturbances. To overcome 
this drawback, the online tuning of the PID control parameters is more effective. Mainly, combining the PID 
control strategy with intelligent techniques improves the quadrotor control performances. Thus, several 
recent published works have incorporated intelligent algorithms to ensure online PID parameter adjustment, 
such as fuzzy logic control (FLC) [21], [22] and neural networks (NN) [23]-[25]. 

Artificial neural networks (ANNs) have been widely studied due to their system identification and 
control design advantages. The deep neural network (DNN) is primarily defined as an exciting area of neural 
network (NN), which has gained popularity in recent years. The architecture of DNN is based on a 
multi-layer NN that contains multiple hidden layers. A typical model of DNN is the deep feedforward 
network (DFN). This self-learning algorithm allows filtering information through multiple hidden layers in 
forwarding directions using the ANN algorithm. Significantly, the DNN has enabled significant progress in 
sound and image processing applications, including feature detection, facial recognition, object 
identifications, computer vision, and text classification [26]—[28]. Besides, potential applications of DNN are 
numerous in control system engineering [29]. NN and similar approaches such as DNN and DEN can provide 
better results when used to online tune the PID controller parameters [13], [23]-[25] for controlling the 
quadrotor system. For instance, an ANN was utilized to adjust the PID parameters [23], where a comparative 
study of three types of training methods (Bayesian regularization, Levenberg-Marquardt, and scaled 
conjugate gradient) was performed to minimize the mean square error (MSE) when controlling the roll, pitch, 
yaw, and altitude of a quadrotor system. Then, as the proposed adaptive ANN-based PID can improve the 
quadrotor tracking performances, a decentralized PID neural network (PIDNN) control scheme was proposed 
to stabilize a quadrotor’s attitude in the presence of the Dryden model of wind disturbance. Simultaneously, a 
conventional PID controller was utilized in the outer loop to generate the inner-loop reference path [24]. 
Then, it was shown that this controller could reject the external disturbances with good stability. An adaptive 
PID controller was also proposed to stabilize a quadrotor system’s attitude and position with unknown 
variable payloads. It was shown that the neural network could deal with the unknown variable payload by 
online tuning of the PID control parameters [25]. A comparative analytical method has also shown the 
advantages of ANNs over a UAV attitude controller’s PID conventional control method [13]. 

Inspired by the above works and considering their result limitations, this paper aims to present a 
new control strategy based on an adaptive PID deep feedforward network (APIDDEN) for quadrotor UAVs. 
Specifically, the proposed learning control solution uses: i) deep feedforward network composed of two 
hidden layers, ii) the one-step secant (OSS) method selects the deep feedforward network’s initial weights 
and bias, iii) a multidimensional particle swarm optimization (PSO) algorithm to adjust the elaborated deep 
feedforward network (DFN), and iv) a quadrotor (model-based) simulation setup in Dryden wind turbulence 
and assuming a payload mass change during the vehicle motions. It is worth noticing that the system model is 
always highly simplified, the relation between the body-fixed and earth-fixed velocities is omitted, and 
external disturbances are ignored in almost previous works related to controlling the quadrotor UAVs. No 
simplifications are used in this work. 


Int J Elec & Comp Eng, Vol. 12, No. 4, August 2022: 3607-3619 


Int J Elec & Comp Eng ISSN: 2088-8708 O 3609 


Our new result may be summarized as follows: using the DFN learning algorithm, a performance 
evaluation obtained through analyses and a model-based simulation setup shows that combining a PID 
controller with the DFN algorithm allows designing an efficient adaptive controller that can stabilize the 
aerial vehicle Euler angles and guarantee a perfect trajectory tracking for arbitrary linear position reference 
trajectory, despite payload mass variations and external wind disturbances. Thus, the proposed control 
strategy could be considered an additional controller for nonlinear highly coupled quadrotor UAVs. 

This paper is organized as follows: the underactuated quadrotor dynamics, the Dryden wind model, 
and the APIDDEN control design are described in section 2. Then, section 3 presents the simulation results 
and discussion. Finally, section 4 concludes the paper. 


2. RESEARCH METHOD 

In this section, the APIDDFN controller design is presented for the quadrotor system in the presence 
of the Dryden wind disturbance and payload mass change. For clarity, the quadrotor dynamics and the 
Dryden wind turbulence models are first presented. Then, the whole control design steps are described in 
detail. 


2.1. Quadrotor dynamic modelling 

A quadrotor is equipped with four rotors: two rotors (1 and 3) rotate in a counterclockwise direction, 
while the other rotors (2 and 4) rotate in a clockwise direction. This four-rotor actuation can generate six 
motions: three linear translations (along the x, y, and z axes) and three angular rotations, namely the g-roll, 
8-pitch, and w-yaw Euler angles. Thus, the quadrotor is an underactuated highly coupled system with its six 
degrees of freedom (6DOF) and only four input actuators. 

Now, to establish the quadrotor model, two coordinate systems should be considered: an earth-fixed 
coordinate frame (F, = (O, Xe, Ye, Ze)), and a body-fixed coordinate frame (F, = (G, Xp, Yp, Zp)), whose 
origin is the vehicle center of gravity (CoG). For clarity, Figure 1 shows the quadrotor X structure. The 
quadrotor system model may be summarized as (1) [30]: 


x= (cosecy + Sosy) = + — fx 
Ý = (cosasy — Soy) $ + — fwy 
2 = coco =- g +— fyz A 
ond Shiva 4 
= ee a () 
b= rE op 
oe wyi 
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where x, y and z denote the cartesian three-space position, g, 8, and w denote the aerial vehicle orientation, 
T is the total thrust, m denotes the vehicle mass, g is the gravitational acceleration, jy, jy, and j, denote the 
moments of inertia along the three axes x, y, and z, respectively, Ty, Tg, and Ty denote the propeller torques, 
and fx, fwy, and fwz denotes the aerodynamic force. For simplicity, we adopted the following notations: 


Cp: = cos (f) , and sg := sin (F) , for p € eee 
2.2. The Dryden wind turbulence 

The turbulence process is stochastic and is expressed by velocity spectra [31]. In this work, we 
apply the most frequently used model called the Dryden turbulence model [32]. Without loss of generality, 
we admit that the turbulence field is fixed in time and space. However, we assume using a random, 
homogenous, and isotropic turbulent wind. The Dryden turbulence model is characterized by its power 
spectral densities (PSD). These power spectral densities correspond to the longitudinal, lateral, and vertical 
linear velocity components for the MIL-F-8785C model [31] and are defined by (2): 


20x? Ly 1 
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where the turbulence intensities (0,,, o, and gz) correspond to the turbulence magnitude in the longitudinal, 
lateral, and vertical axes, respectively. The turbulence scale length (Ly, L, and Lz) represent the turbulence 


®,(w) = Py(w) = (2) 
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field length in the longitudinal, lateral, and vertical axis, respectively. w is the time-frequency, and V is the 
air velocity. 

Implementing the Dryden wind model consists of passing the band-limited Gaussian white noise 
signal through a shaping filter function to generate an output signal with spectral properties. The turbulence 
signal must have the same frequency spectrum as the PSD of the Dryden wind. For this reason, we should 
use the following filters defined by (3) [33]: 


vBLy V3Lz 
2L 1 L 1+—;+;s L 1+ S 
H, (s) =o. =, H, (s) = o, |2. — H,(s) = o, |4.—* (3) 
x x TV 14 ets” y y TV (12s) 7 Z Z TV (1442s) 


Continuous Dryden filter is a low-pass filter, where any frequency higher than the cutoff frequency 
is eliminated. The filter cutoff frequency is given by the ratio of turbulence scale length to the airspeed. Then, 
according to [31], at low altitude, the turbulence scale lengths and intensities are defined by the following 
equation where W,, denotes the wind speed at the height of 6 meters, and h denotes the quadrotor altitude. 


h 0.1W20 
= ———————_. 0, = 0.1W. Oy = Oy, = ————_——_, 
Y ~ (0.177+0.000823h)12? Z 20 0% Y ~ (0.177+0.000823h)°-4 
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Figure 1. Quadrotor system X-structure with its coordinate frames 


2.3. Adaptive PID deep feedforward network design 

In this work, the control of the quadrotor system is based on an APIDDFN. This controller is 
designed by combining the conventional PID control strategy with the DFN algorithm. The DFN algorithm is 
the most common kind of neural network architecture mapping input to output through hidden nodes using a 
sequence of layered transformations that connect the neurons of each layer to those of the next layer. For 
clarity, algorithm 1 gives the main steps of the entire controller design process. Then, a description of this 
algorithm is given below. 


Algorithm 1. PSO-APIDDFN algorithm steps for controlling the quadrotor system 
1 Collect the quadrotor control system dataset 

2 Set the deep feedforward network (DFN) architecture 

3 Initialize the DFN parameters (weights and bias) 

4 Design the adaptive DFN controller 

5 Tune the DFN parameters using the multidimensional PSO algorithm 


The first step in the APIDDFN design is collecting the training data sets necessary to carry out the 
learning process. Indeed, we use the PID control strategy to collect the data set required to design the deep 
feedforward networks (DFNs), namely DFN,, DFNy, DFN}, DFN, DFNg and DFNy. The data sets 
collected from the quadrotor system response are composed of 2-dimensional vector [e U ]" where e is the 
controller error that denotes the difference between the actual and desired output value, and U is the PID 
controller output for the corresponding system state. 
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One of the essential features that affect the resultant network in the learning task is the network 
architecture, as the hidden layer number and neuron number in each layer are essential in the network design. 
One hidden layer and a few neurons number may be insufficient. However, multi-layer architecture with 
many neurons may imply a superior computation time and may not be more efficient for solving complex 
problems. 

In the proposed APIDDEN, each DEN has four layers: one input layer, two hidden layers, and one 
output layer. The input layer has two neurons corresponding to error signal e and the feedback value from the 
current system output. The output layer has three (two) neurons that consist of the kp, k; and kg (kp and ka) 
PID (PD) gains. The structure of the proposed deep feedforward network is shown in Figure 2. Roughly, each 
connection has weight and bias values that are represented by matrices of numbers, where In = [In, In,]" 
denotes the network input vector, and W*(b*), for k = {1,2,3}, denotes the matrix value that represents the 
weight (bias) of the connection between consecutive layers. The mathematical equations of the DFN 
algorithm can be described as follows: 

- Layer 0 (input layer): 


h? = In (5) 


where In is the network input vector. In this paper, as the input layer comprises two neurons, the input vector 
dimension is (2x1). 
- Layer 1 to layer 2 (hidden layers): 


hi = (bi +W'h'?) (6) 


For each hidden layer i, i € {1,2}, Wt is the layer weight that represents the strength of the 
connection between the neurons of the hidden layer i and the previous layer (i — 1), bÍ is the neuron bias 
added to produce the net input, and ® is the hidden layer activation function. Noting that, Wt is a (rxc) 
matrix and bÍ is a (rx1) vector, where r is the current layer neuron number, and c is the previous layer 
neuron number. 

In this work, Wt is a (3x2) matrix, bt and b? are a (3x1) vectors, W? is a (3x3) matrix. The 
activation function applied to each hidden layer is the sigmoid function, which is defined by (7): 


oz) => (7) 
- Layer 3 (output layer): 
Yp = V(b? + Wh?) (8) 


where ¥ is the output layer activation function and y, = [out, out, out]? denotes the network output 
vector. In this work, the output vector dimension is (3(2)x1), W? is an (3(2)x3) matrix, b? is an (3(2)x1) 
vector and the linear activation (i.e., no activation function) is chosen as activation function. 

The next step consists of initializing the deep feedforward network weights and biases. The initial 
training of DFN parameters is the process that permits finding the weights and bias values for all network 
layers to map the input data to the associated output. This training operation needs running several iterations 
making minor changes to the DFN parameters until a minimum of a cost function is reached. Many 
algorithms can be used, such as Levenberg-Marquardt (LM) and the backpropagation algorithms that use 
Jacobian derivatives, known as the fastest algorithm [34]. Roughly, the backpropagation training function 
could use the gradient derivatives, such as gradient descent (GD), the Broyden, Fletcher, Goldfarb, and 
Shanno (BFGS) quasi-newton method, or the one-step secant (OSS) algorithm. In this work, after several 
tests, we have chosen the one-step secant (OSS) algorithm as it was shown to ensure satisfying results using 
less memory and computational time compared with the BFGS algorithm [35]. 

In this paper, the APIDDFN is carried out to stabilize the quadrotor system for trajectory tracking 
objective. The APIDDEN control method comprises the conventional PID control strategy and the deep 
feedforward network (DFN) algorithm. Especially for the quadrotor system, six APIDDFNs controllers 
denoted as APDDFN,, APDDFN,, APIDDFN,, APIDDFNg, APIDDFNg and APIDDFNy, are designed. 
Thus, six DFNs controllers are used to online adjusting the gains of the PID (PD) controllers for z, o, 0, and 
w (x and y). Figure 3 shows the structure of the APIDDFN, for the z system-state, where U, is the controller 
output, Z is the current system output, and e, is the difference between the desired trajectory Zg., and its 
actual value z. For clarity, a global control system schema is provided in Figure 4. 
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Input layer Hidden layer 1 Hidden layer 2 Output layer 
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APIDDFNpitch controller 


Reference trajectory APIDDFNyaw controller 


Quadrotor Plant 


Figure 4. Global control system schema 


Besides, once the network architecture has been chosen, the weights WE and biases b* should be 
trained in each layer, for k € {1,2,3}. Appropriate values are obtained for these parameters by minimizing a 
cost function that computes the difference between the predicted output obtained and the actual output value. 
Then, without a loss of generality, we choose the integral of time-weighted absolute error (ITAE) as a 
performance criterion in the considered control problem. Thus, letting e(t) denote the error between the 
target value of the output and the actual output value, the ITAE performance criterion is defined by (9): 


ITAE = f, tle(t)|dt (9) 


The next step consists of adjusting the weights and bias to better dynamic behavior for the 
closed-loop quadrotor system until the chosen cost function reaches a minimal value. Thus, we use the 
multidimensional PSO algorithm to find the 33 (29) DFN parameters’ optimum values. Indeed, PSO is a 
population-based stochastic optimization algorithm that has experienced many enhancements [36]. Thus, 
PSO has been extensively utilized to solve optimization problems, where it has proven its effectiveness in 
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different research areas [37]. To describe the specific DFN parameter tuning process, algorithm 2 gives the 
multidimensional PSO algorithm main steps. Each step of algorithm 2 may be described as follows: 

First, the PSO algorithm parameters should be selected. Practically, to tune the optimum value of the 
33 (29) DEN parameters, the population number n is set to 200, and the maximum iteration is set to 5. 
However, the right choice of the search space parameter (ss), which defines the search space limits during 
the tuning process, allows computing the parameters’ maximum possible values to be tuned. For this reason, 
we choose a different search space parameter for each system state. Then, the PSO algorithm is initialized 
with random particles that correspond to the DFN weights and bias parameters’ possible values. In steps 3 
and 4, the particles are evaluated to compute the local best fitness y1(t), which is known as the cognitive 
component, and the global best fitness x(t), which is known as the social component. In step 5, the PSO 
algorithm computes the values of actual position y(t) and actual velocity v(t) of the particles, which affects 
the movement of the particles according to (10): 


he +1) = y vt) + an E) - xO) + anv’? - xO) (10) 
x(t + 1) = y(t) + v(t +1) 

where y is the inertia weight used to adjust the impact of the previous velocity value upon the current one. 
(c1, c2) numbers are the cognitive and social constants, and (7,72) numbers are arbitrarily chosen in [0,1]. 
Afterward, if the maximum iteration number is not yet reached, the algorithm returns to step 2. Otherwise, 
algorithm 2 provides the optimum values of DFN parameters. 


Algorithm 2: Multidimensional PSO algorithm for the DFN 
Begin 
1 Set the PSO parameters (population number, search space, maximum of iteration ...) 
Loop: 
Initialize random particles (DFN weights and bias) 
Evaluate the local best fitness for each particle 
Evaluate the global best fitness 
Update the velocity and position of particles 
Until maximum iteration is reached 


AUN 


3. RESULTS AND DISCUSSION 

The completely dynamic control system setup is implemented using the MATLAB simulation tool 
to test and show the proposed controller’s validity for the quadrotor aerial vehicle. First, the feedforward 
network (FN) and the DFN structures adjust the PID/PD controller gains for each quadrotor system state. 
Then, the elaborated final DFN weights and biases are optimized using the PSO algorithm. Besides, a 
comparison between all the above controllers is provided to show the controller’s effectiveness. Finally, to 
highlight the proposed PSO-based APIDDFN controller validity, an efficiency test is performed. The 
quadrotor aerial vehicle’s payload is assumed to be changing during its motions and subjected to the Dryden 
turbulence wind. 

The quadrotor parameters are chosen similar to those used in [38]. Then, the proposed controller is 
conducted to stabilize the quadrotor attitude and ensure the following x, y, and z trajectory-tracking: 


1 te [25s 55s] 


1 te [45s 75s] A =f} te [5s 95s] 
0 otherwise 


0 otherwise ? °* 0 otherwise 


Xdes = { » Ydes = { (11) 
Then, we aim to show the proposed APIDDEN controller effectiveness compared to three other control 
methods implemented to stabilize the quadrotor system using the following simulation scenario. 


3.1. Quadrotor control by the conventional PID controller 

The first control method consists of controlling the quadrotor system by a fixed-parameter PID/PD 
controller. Table 1 summarizes the PID/PD gains obtained for each system state using the MATLAB PID 
Tuner. Thus, to design the FN and DFN systems, the training dataset is collected from error and output 
values when the above-tuned PID controller controls the quadrotor system. 


3.2. Quadrotor control by the adaptive PID feedforward network 

The second method consists of controlling the quadrotor by the APIDDFN, where six APIDFN 
controllers are implemented. Each APIDFN controller comprises a PID and a feedforward network (FN) 
algorithm containing three layers: one input layer, one hidden layer, and one output layer. The one-step 
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secant (OSS) algorithm is used to initialize the FN weight and bias. Table 2 summarizes the 21(17) FN 


parameters obtained for z, p, 0, and w (x and y) quadrotor system states. 


Table 1. PID/PD controller initial gains 


x y Z 0 
Kp 01 01 75 0.2339 0.2339 0.00274 
Ki -- -- 3.075 0.0929 0.0929 0.0001 
Kd 0.2 02 5.5 0.08638 0.08638 0.1 
Table 2. FN system parameters 
wt bt w2 b? 
TAIS SLAIT 2.7265 —0.1914 —0.0024 —0.0470 0.2428 
x —1.0460 2.1203 0.0176 er 0000049 poe 
1.8570 1.5275 2.4471 : ' i 
70:1625 = 2.1334 FARA 0.0856 0.0326 0.2804 —0.1916 
y T0022 eA Te —0.1094 —0.0268 —0.0232 [zo 0954 
1.0827 —1.4031 2.7171 l ; i : 
1.8357 —1.3429 2.6015 -0.1721 0.0042 —0.3515 0.5038 
z - 1.4657 -19203 -o oal | 0.4011 —0.0161 -011328| |=03017] 
2.2023 0.2091 2.6620 0.1481 —0.0101 —0.1933 0.0980 
1.9734  —1.4623 —2.3548 0.0567 0.0694 —0.5032 0.5841 
o | 1.8060 -1391| o. oasa |=23739 0.1919 =09959 f 0.5969 | 
2.3032 —0.9046 2.3236 0.8101 0.1823 -0.3795 0.4967 
1.3642 1.8727 2.5868 —0.8604 0.1266 0.3287 0.5630 
6 f 0.8303 2.2216 | [i 0.0926 | | 0.1905 0.0896 0.7230 | [i 0.8973 | 
1.5594 —1.7780 2.4978 0.6688 —0.1112 —0.8360 0.1628 
1.8379 0.9877 2.7657 —0.1470 —0.0861 —0.2114 0.2902 
w f 2.1095 -0542| [i 0.1496 | | 0.0432 —0.0209 Zooss] [f 0.1077 | 
1.6207 1.7556 2.3730 —0.0323 0.0070 0.0184 0.0518 


3.3. Quadrotor control by the adaptive PID deep feedforward network 

The third method consists of controlling the quadrotor system using the APIDDFN that comprises 
one additional hidden layer than the APIDFN. The OSS algorithm is also used as a training method for 
initializing the DFN parameters. Table 3 provides the 33 (29) DFN parameters for z, p, 0, and wW (x and y) 
quadrotor system state. 


Table 3. DFN system parameters 


wt w? w? b? 
-1.2635 -1.8703 2 a -1.0528 -1.0635 1.0501 2.1729 

s [Ess rasse] fo a [i403 oaos 10004 P [06447 -08081 -09887 [0.2324 
1.3386 1.7651 2.6482 -0.2118 1.2808 1.3327 2.1444 3 ` i ' 
2.2750 0.8316 2.4105 1.5614 -1.2710 0.0251 2.2945 

y [Ess 172s) [oss| [izon 03653 tao] foo) [are Gaur? oa a 
0.9982 -1.9617 2.7044 -0.1804 -1.1829 0.8886. 2.8061 : ' 
1.9363 -1.4086 2.4772 0.5245 1.2985 1.4103 2.0345 0.4751 0.1462 -0.1361 0.4341 

Zz | 1.8222 1. 4957| [e 0608 | 0.8245 -0.9529 48775 [e 1698] | 0.3874 0.3852 20725 | 0. oo| 
0.1388 2.1700 2.6938. 0.6241 1.4304 0.9536 2.2061 0.7497 0.3510 -0.0788. 0.5572 
2.3148 -0.7212 2.4183 -0.3337 -1.5494 0.0845 2.4315 0.4162 -0.1092 0.5896 0.0923 

(0) [e 4887 -2. 3122] : 0. 2637] [02354 -1.3129 12057] [e ooe] | 0.2739 0.0601 2.0620 | [e osc 
2.2385 -0.9351 2.4162 -0.0742 -1.9902 -0.5102 1.9356. 0.3692 -0.3557 -0.3936 0.4138 
1.0699 2.0483 2.5645 1.4605 0.0494 -1.4034 1.9564 0.0938 0.3084 0.0920 0.0978 

0 | 2.2879 -0. 7388] [e 0132| | 1.3389 -0.8714 "1.0487 [e 024o [e 5909 -0.4233 0.2745 | Jesse 
1.6393 -1.7824 2.4319. -1.0406 0.0780 -1.7284 2.0190. 0.6534 -0.3072 -0.5427 0.2059. 
1.3434 -1.6871 2.7498 1.9468 -0.0490 -0.4695 1.9837 0.4094 -1.3674 -0.2009 -0.7306 

p 0 9313 2. 2056 [e 0267 | 0.9199 0.0671 12957| | 1. 3400] | 0.0300 0.1989 20365| [0:1025] 
0.9701 3.3114 1.2526 0.8037 -0.3096 -1.9388. 1.6927 0.0016 0.7188 0.1032 0.6457 


3.4. Quadrotor control by the PSO-based adaptive PID deep feedforward network 


The fourth method consists of controlling the quadrotor system using the APIDDFN controller. In 
this case, the control parameters are optimized using the PSO algorithm (PSO-APIDDFN). For clarity, 
Table 4 shows the new 33(29) PSO-tuned DFN parameters for each quadrotor system state. 
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Table 4. DEN system parameters tuned by the multidimensional PSO algorithm 


w+ w? w? b? 


0.3320 4.0000 3.6509 0.2276 2.6606 —2.0000 2.1355 
s [Foases sapeo] EA [-a7656 “07008 -rerl |- E [05444 0.0178 01341] [14377 
1.2157 1.5322 2.2516 .2548 0.2497 0.7498 1.6769 ` ` 
0.0021 1.6944 3.5567 0.5380 1.1247 2.0000 1.0000 
y (2er Sissel |-osrs7] |-27os3 3204 oaa) Coss] [uen nae 09632] BR 
—0.0741 1.3784 2.7979 0.0401 —0.0941 0.3346 2.0174 ` j i 
0 1.2998 2.5406 —0.7441 1.1434 1.8354 —1.9879 1.0000 1.4376 —1.1577 0.8648 
Zz —0.8544 3.0684 1.1023 0.7360 —1.0000 —0.3674 0.9285 0.1954 —0.9893 —0.5038 1.4695 
1.4629 1.1332 4.0000. 0.3268 2.1546 1.4486 —2.8496 —2.0000 0.7333 —1.7944 —0.2768 
—2.0000 —1.7835 4.0000 —0.5467 —0.3974 2.9705 0.5489 —0.1122 —0.1534 1.1646 
p —1.5054 —2.4023 —0.3487 0. S —0.6741 —1.6522 1.0000 —0.1632 0.8813 0.2801 1.4386 
—0.4962 0.4424 —3.1380. —3.0000 —0.8071 1.0936 —2.9764 2.0000 2.0000  —1.0000. 1.0000 
0 —0.5770 —2.3539 2.0000 1.8656 —0.7121 —3.0216 0.9232 0.0311 —0.4625 —1.0655 
0 —0.3557 3.6338 0.0436 0.2626 1.3522 0.7005 1. Pa 2.0000 —0.3400 0.0296 1.0000 
—1.2773 —2.0000. —3.8555 1.5814 1.3641 3.0000. —0.7128 2.0000 2.0000 2.0000 
1.9006 —0.6465 —2.3242 1.8107 1.3353 —1.5015 —0.9729 —0.6805 —1.2258 0.4413 1.2186 
p —2.3708 1.0607 —0.3009 1.6569 1.6522 —0. p —0.0321 0.9313 —0.7401 —0.4986 0.6912 
—1.2146 1.5179 —1.0000 —0.7715 1.0510 2.8002 —0.2053 0.1624 0.9461 2.0000. 


3.5. Controllers’ performance comparison 

It is worth noticing that the quadrotor linear acceleration involves the total thrust force and the Euler 
angle states. Then, it is challenging to stabilize this system in an open-loop scheme [39]. For this reason, the 
quadrotor system should be stabilized using a suitable control strategy. Besides using the PSO algorithm, the 
proposed control method allows better system performance in the closed-loop. 

The time response for linear positions is shown in Figure 5. The performance characteristics 
comparison such as percent of overshoot (Mp), rise time (T,), settling time (Ts), and ITAE criterion between 
these four control strategies is summarized in Table 5. It can be seen from Figure 5 and Table 5 that the 
APIDDFN is slightly better than the APIDFN. Thus, it is evident that using additional hidden layers has 
improved the control system’s performance. Besides, except for the rise time of the z and x system states, the 
PSO-APIDDEN controller shows better performances, a faster convergence speed, a minimum overshoot, 
and a minor ITAE performance criterion compared to the other controllers 

Then, Figure 6 shows the g, 0, and w quadrotor attitudes for the PID, APIDFN, APIDDEFN, and 
PSO-APIDDEN controllers. It is shown that the four control methods can ensure the stabilization of the 
quadrotor attitude. Roughly, g and @ angle values do not exceed 4 degrees for all these control strategies as 
shown in Figures 6(a) and 6(b). Besides, the w-yaw angle value is almost null, especially for the PSO- 
APIDDEN controller as shown in Figure 6(c). 


Table 5. Control system performance comparison 


x y Zz 
M,(%) T,(s)__T,(s)__ITAE _M,(%) _T,(s) _T,(s)__ITAE M,(%) Ts) _T,(s)__ITAE 
PID 9.432 1.763 7376 0.491 9432 1.763 7.376 0.722 9.975 0.6103 3.613 0.144 
APIDFN 8.719 1.770 7.292 0.481 8.845 1.780 7.088 0.699 9.916 0.6102 3.609 0.143 
APIDDFN 7.890 1.702 6.355 0.395 8.671 1.776 7.277 0.709 9.701 0.61 3.574 0.144 


PSO-APIDDFN 1.026 1.751 2.968 0.075 1.134 1.729 2.929 0.111 1.529 0.6923 2.144 0.0523 


3.6. Efficiency test 

The simulation tests are also performed to illustrate the above controller efficiency when the 
quadrotor system is subject to external disturbances. Roughly, two types of disturbances are considered: a 
quadrotor mass change and external wind. First, Figure 7 shows the case of 20% mass increasing between 
10 s and 30 s, and decreasing, between 60 s and 80 s as shown in Figure 7(a), and its effect on the vehicle 
z-altitude as shown in Figure 7(b). The total mass change caused a slight overshoot in the PSO-APIDDFN 
(about 5%) than the other controllers (about 16.6%). Thus, the proposed controller can compensate for the 
mass change by changing the PID/PD gains. Then, Figure 8 illustrates the z-altitude PID gains variation in 
the case of APIDFN as shown in Figure 8(a), APIDDFN as shown in Figure 8(b), and PSO-APIDDFN as 
shown in Figure 8(c) controllers when the quadrotor system is subject to a mass change. These subfigures 
show the adaptive behavior of the three controllers. However, unlike the PID controller gains that remain at 
the same value in all simulation time, the gains elaborated in the PSO-APIDDFN controller change according 
to the behavior of the quadrotor system outputs. 
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Secondly, we simulate the effect of the external wind on the quadrotor system translational motion. 
This disturbance is conceived using the above Dryden model, where the wind speed is 20 m/s. Figure 9 
shows the trajectory tracking of the quadrotor system under the wind. The above results show that the 
PSO-APIDDEFN controller has an excellent wind disturbance-rejection compared to the other control 
strategies, for x-state as shown in Figure 9(a), y-state as shown in Figure 9(b), and z-state as shown in 


Figure 9(c). 
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Figure 5. x, y, and Z linear positions time responses 
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Figure 6. Euler angles stabilization a) p-roll, b) @-pitch, and c) w-yaw 
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Figure 8. Variation of altitude PID gains in case of (a) APIDEN, (b) APIDDEFN, and (c) PSO-APIDFN 
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Figure 9. Effect of the wind on the translational motion: (a) x-position, (b) y-position, and (c) z-altitude 


4. CONCLUSION 
In this paper, an adaptive proportional integral derivative deep feedforward network (APIDDFN) 


controller is proposed for stabilizing the trajectory tracking control of an underactuated quadrotor system. 
The deep feedforward neural network (DFN) is used to online adjusting the PID controller parameters. A 
one-step secant algorithm is utilized to initialize the DFN system weights and biases. Besides, a 
multidimensional particle swarm optimization (PSO) algorithm is utilized to optimize the DFN parameters. 
The simulation results illustrate that the proposed PSO-APIDDEN controller can achieve faster convergence 
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speed and show minimum overshoot than the non-optimized adaptive PID feedforward network and adaptive 
PID deep feedforward network. These results demonstrate the efficiency of the proposed method in designing 
an intelligent controller for stabilizing the quadrotor UAV. Future works are related to integrating an 
observer in the control system to estimate unmeasured translational and rotational quadrotor system states 
and rates. 
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